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Abstract 

Background: The enteroaggregative E. coli heat-stable enterotoxin 1 (EAST1) encoded by astA gene has been 
found in enteropathogenic E. coli (EPEC) strains. However, it is not sufficient to simply probe strains with an astA 
gene probe due to the existence of astA mutants (type 1 and type 2 SHEAST) and EAST1 variants (EAST1 v1-4). In 
this study, 222 EPEC (70 typical and 152 atypical) isolates were tested for the presence of the astA gene sequence 
by PCR and sequencing. 

Results: The astA gene was amplified from 54 strains, 1 1 typical and 43 atypical. Sequence analysis of the PCR 
products showed that 25 strains, 7 typical and 18 atypical, had an intact astA gene. A subgroup of 7 atypical strains 
had a variant type of the astA gene sequence, with four non-synonymous nucleotide substitutions. The remaining 
22 strains had mutated astA gene with nucleotide deletions or substitutions in the first 8 codons. The RT-PCR results 
showed that the astA gene was transcribed only by the strains carrying either the intact or the variant type of the 
astA gene sequence. Southern blot analysis indicated that astA is located in EAF plasmid in typical strains, and in 
plasmids of similar size in atypical strains. Strains carrying intact astA genes were more frequently found in diarrheic 
children than in non-diarrheic children (p<0.05). 

Conclusion: In conclusion, our data suggest that the presence of an intact astA gene may represent an additional 
virulence determinant in both EPEC groups. 
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Background 

Enteropathogenic Escherichia coli (EPEC) are an import- 
ant cause of infant diarrhea in developing countries [1]. 
The majority of EPEC isolates belong to classic serotypes 
derived from 12 classical O serogroups (026, 055, 086, 
Olll, OH4, OH9, 0125, 0126, 0127, 0128, 0142, 
and 0158) [2,3]. EPEC induces attaching and effacing 
(A/E) lesions on epithelial cells, characterized by micro- 
villi destruction, cytoskeleton rearrangement, and the 
formation of a pedestal-like structure at the site of bac- 
terial contact [4], The A/E genes are localized to the locus 
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for enterocyte effacement (LEE) and encode intimin, a 
type III secretion system, secreted proteins and the trans- 
located intimin receptor [5-7]. 

"Typical" EPEC strains (tEPEC) contain also the EPEC 
adherence factor (EAF) plasmid [8], which carries genes 
encoding a regulator (per) [9] and the bundle-forming 
pili (BFP) [10]. EPEC strains lacking the EAF plasmid 
have been designated "atypical" EPEC (aEPEC) [11]. Re- 
cent epidemiological studies indicate that aEPEC are more 
prevalent than tEPEC in both developed and developing 
countries [1]. Some aEPEC strains are genetically related 
to the enterohemorrhagic E. coli (EHEC), and both are 
considered as emerging pathogens [12]. 

Typical EPEC strains express only the virulence factors 
encoded by the LEE region and the EAF plasmid, with 
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the exception of the cytolethal distending toxin pro- 
duced by 086:H34 strains and the enteroaggregative 
heat-stable enterotoxin 1 (EAST1) found in 055:H6 and 
0127:H6 strains. In contrast, aEPEC strains frequently 
express EAST1 and additional virulence factors not 
encoded by LEE region [12]. In a previous study [13], 
EAST1 was the most frequent (24%) virulence factor 
found in a collection of 65 aEPEC strains, and was sig- 
nificantly associated with children diarrhea. 

EAST 1 -positive aEPEC strains have been associated 
with outbreaks of diarrhea involving children and adults 
in the United State [14] and Japan [15]. However, it is 
not sufficient to simply probe strains with an astA gene 
probe due to the existence of EAST1 variants [16]. In 
one study, 100% of the 026, Olll, 0145, and 0157:H7 
enterohemorrhagic E. coli (EHEC) strains examined 
carried DNA sequences homologous to the EAST1 
gene (SHEAST) with two different mutation types. 
Type 1 SHEAST has 12 nucleotide non-synonymous 
substitutions including one in the initiation codon; 
type 2 SHEAST lacks the first 8 codons of EAST1 se- 
quence [16]. The focus of the study was to investigate 
the astA gene sequence present in tEPEC and aEPEC 
strains. The strains were collected in different cities of 
Brazil in different periods of time and in a previous 



study poor relatedness was observed by RAPD analysis 
of 118 strains belonging to this collection [13]. 

Results and discussion 

We examined 222 EPEC strains (70 typical and 152 
atypical) for the presence of the astA gene by PCR using 
primers that anneal to the 5' ends of the EAEC 042 astA 
gene sequence [16]. Those strains were isolated from 
diarrheic and non diarrheic Brazilian children in previous 
studies [17-20]. As shown in Table 1, 11 (16%) tEPEC and 
43 (28%) aEPEC strains were positive in the PCR assay. 
Among the aEPEC PCR-positive strains, 13 belonged to 
the 026 and OH9 serogroups. 

The 54 astA gene PCR products were sequenced. 
Twenty five strains, 7 tEPEC and 18 aEPEC, carried the 
DNA sequence identical to the EAST1 gene (042-type 
EAST1) (Figure 1). A subgroup of 7 aEPEC strains pre- 
sented a variant type of the 042-type EAST1 gene sequence, 
with four non-synonymous nucleotide substitutions. Nine 
other strains, including one typical, carried either the 
sequence identical to type 1 SHEAST (7 strains) or to 
type 2 SHEAST (two strains). The remaining 13 strains 
carried mutated sequences of the 042-type EAST1 (five 
strains), type 1 SHEAST (two strains) or type 2 SHEAST 
(six strains) genes. 



Table 1 EPEC-astA strains isolated from diarrheic and non-diarrheic children 



EPEC 



Serotype 



No. of strains (positive/total) 







Diarrheic children 


Non-diarrheic children 


Total of children 


tEPEC 


055:NM;HND 


0/13 


0/1 


0/14 




086:NM;H34 


0/2 


0 


0/2 




01 1 1:NM;H2,HND 


4/9 


0 


4/9 




0119:NM;H6;HND 


2/22 


0/3 


2/25 




0127:NM;H6 


0/1 


2/3 


2/4 




Other serotypes 3 


3/14 


0/2 


3/16 


Subtotal 




9/61 


2/9 


11/70 


aEPEC 


026:H11;HND 


6/10 


0/2 


6/12 




055:HND 


2/3 


1/2 


3/5 




0111:NM 


2/2 


1/2 


3/4 




0114:NM 


0 


0/1 


0/1 




0119:H2;HND 


7/9 


0/3 


7/12 




0126:NM 


0/1 


0 


0/1 




O127:NM;H40 


0/3 


0/1 


0/4 




0128:NM 


0/3 


0 


0/3 




0142:NM;H2 


1/8 


0 


1/8 




Other serotypes 6 


18/68 


5/34 


23/102 


Subtotal 




36/107 


7/45 


43/152 


Total 




45/168 


9/54 


54/222 



a 02:H2;H45; O101:H33; 0145:HND; 0157:HND; 0162:H33; ONT:H45; ONT:HND. 

b 04:HND; 015:HND 033:H6; 035:1-119; 037:HND; 049:HND; 061:HND; 063:HND; 079:HND; O85:H40; 096:HND; 098:HND; O101:NM; O103:NM; O105:H7; O108:H31; 
O109:H54; 0117:HND; 0132:HND; 0141:HND; 01523H2; 0156:H16; 0157:HND; 0167:H6; 0169:H6; 01 75:HND;ONT:NM; ONT:H18; ONT:HND. 
Note: NM, nommotile, ND, nondetermined, ONT, nontypeable. 
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EPEC strains 




Met 


Pro 


Ser 


Thr 


Gin 


Tyr 


lie 


Arg 


Arg 


Pro 


Ala 


Ser 


Ser 


Tyr 


Ala 


Ser 


Cys 


lie 


Trp 


Cys 


Ala 


Thr 


Ala 


Cys 


Ala 


Ser 


Cys 


His 


Gly 


Arg 


Thr 


Thr 


Lys 


Pro 


Ser 


Leu 


Ala 


Thr 


Stop 



ATG CCA TCA ACA CAG TAT ATC CGA AGG CCC GCA TCC AGT TAT GCA TCG TGC ATA TGG TGC GCA ACA GCC TGC GCT TCG TGT CAT GGA AGG ACT ACA AAG CCG TCA CTC GCG ACC TGA 



T4, T34, T35, T46, T47, T57, 
T70, A1, A3, A30, A46, A57 
A59, A62, A66, A68, A78, 
A90, A101, A102, A105, 
A115, A127, A140, A151 



A16, A23, A47, A120 
A122, A139, A152 



T43 



| . | ... | ... | ... | ... | . . | . . G | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | ... | 

A ATC GC CG C ****************** q j 

|..t|ga.|ag.|.tt| '.|..a|...'|...|...|...|. | . | ... | ' . | ... | . | .|. | . | A ..|...|...|».|...|...|...|. C .| ... | ... | . | . | ... | ... | ... |..t| .|. | .| 

TGA AAG TT . .C . . A ... CCA TC. T TA TGC AC . TT ********* * A . . . . C . . C . C 



T2, A14, A28, A63 



A84, A99, A147 



SHEAST1 ACG CTA TCA ACA CGG TGT ATC CGG AAG CCC GCA TCC AGC TGT GCA TCG TGC ATA TGG TGC GCA ACA GCC TGC GGT TCG TTT CCT GGA AGG ACT ACA AAG CCG TCA CCC GCG ACC TGA 



I'tttttttttttttl " 

*** *** *** CC. T A. CT. GTC 



A71.A81, A131 



SHEAST2 * * * * * * * * * * * * * * * * * * * * * * * * AGG CCC GCA TCC AGT TAT GCA TCG TGC ATA TGG TGC GCA ACA GCA TGC GCT TCG TGT CAT GGA AGG AAT ACA AAG CCG TCA * * * * CG CCC TGA 

l-l-l-l-l-l-l-l-l |...|...|...|...|...|...|...|...|...|...|...|..^ ... 



| | | | | | | * - 1 ^ | C I I I I I ^ | | | | | | | | | | | cagI | ^ | c | | |c= | | | | | | | | 

*** **G . C A 



Figure 1 Nucleotide sequences of the PCR products from tEPEC (T) and aEPEC (A) strains. The nucleotide sequences of the EAST1, 
SHEAST1 and SHEAST2 genes are shown for comparison. Identical nucleotides are shown as dots. Asterisks indicate the positions of nucleotide 
deletion. 



The expression of EAST1 was examined by RT-PCR 
and quantitative RT-PCR. The RT-PCR results showed 
that the astA gene was transcribed only by the strains 
carrying either the intact or the variant type of the astA 
gene sequence (Figure 2). The astA gene expression levels 
of the 32 RT-PCR positive strains (CT values ranged from 
20.3 ± 0.11 to 21.6 ± 0.04) were nearly identical to that of 
EAEC 042 strain (CT value 20.8 ± 0.01). 

Plasmids of the 54 PCR-positive strains were examined 
for astA gene presence by Southern blot hybridization 
with the astA probe. In 23 (42.6%) strains, a single copy 



of the astA gene was located to a large plasmid (Figure 3). 
In all the eleven tEPEC strains, the astA probe hybridized 
to the EAF plasmid as previously reported [21], and in 
twelve aEPEC the astA probe hybridized with large plas- 
mids of similar size. The plasmids of the remaining strains 
were astA probe negative. 

We previously reported that 24% of 65 aEPEC strains 
hybridized with a DNA probe for EAST1 [13]. Here, we 
analyzed by PCR a larger group of EPEC, including typical 
strains and found that 11 (16%) of 70 tEPEC and 43 (28%) 
of 152 aEPEC were astA positive. Sequence analysis of the 




-Mil bp 



Figure 2 Agarose gel electrophoresis of the RT-PCR products of representative strains of tEPEC (T) and aEPEC (A). EAEC 042 strain (C+) 
was used as positive control. Molecular size standard bands are at left. 
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42 KDa— 
24 KDa— 




Figure 3 Southern blot hybridization of the plasmids of tEPEC (T) and aEPEC (A) strains. (A) and (C) Hybridization results with the astA 
probe. (B) Hybridization results with the EAF probe. EAEC 042 and EPEC E2348/69 were used as positive controls (C+) for astA and EAF probes, 
respectively. The arrows in panels A and C indicate the pAA2 plasmid (65-MDa) for EAEC 042 strain and the arrow in panel B indicate the EAF 
plasmid (60-MDa) for EPEC E2348/69 strain. Molecular size standard bands are at left. 



PCR products showed that 7 (63.6%) of 11 tEPEC and 18 
(41.9%) of 43 aEPEC had an intact 042-type astA gene. 

As shown in Table 2, strains carrying intact astA gene 
were more frequently found in diarrheic children than in 
non-diarrheic children (p = 0.03, Fisher's exact test). 
However, we should point out that among the 222 
strains analyzed only 118 were collected from a case- 
control study [13]. 

The EAST1 gene family includes one major type of se- 
quence, i.e. the astA of EAEC strain 042 that is widely dis- 
tributed among different diarrheagenic E. coli strains 
[21-26] and four variant types of EAST1, i.e. the EASTlvl 
of EAEC 17-2 [21,22], EASTlv2 of EPEC Nl [21], and 
EASTlv3 and EASTlv4 of E. coli 0166:H15 [25]. 

In this study, a subgroup of aEPEC strains had a new 
variant type of EAST1 gene sequence that differed from 
those previously reported, and was denominated EASTlv5 



(Figure 4). The RT-PCR analysis showed that EASTlv5 
was transcribed to produce mRNA. However, more stud- 
ies are necessary to determine whether EASTlv5 is associ- 
ated with a functional polypeptide toxin. 

Conclusion 

In conclusion, our data suggest that the presence of an 
intact astA gene may represent an additional virulence 
determinant in both EPEC groups. 

Methods 

Bacterial strains 

The 222 EPEC strains examined in this study included 
176 strains isolated in 1999 to 2004 during an epidemio- 
logical study of acute diarrhea in children <2 years of age 
conducted in different regions of Brazil, and 46 strains 
isolated from children <5 years of age with diarrhea in 



Table 2 Sequences of the astA gene found in EPEC strains isolated from diarrheic and non-diarrheic children 



astA gene sequence 
type 

042-type EAST1 



EAST1 v5 
type 1 SHEAST 
type 2 SHEAST 
mutant 
Total 



N (%) of strains from: 



Diarrheic children 

24(14.3) 

6 (3.6) 
6 (3.6) 
2(1.2) 

7 

45 



Non-diarrheic children 

1 (1.8) a 

1 (1.8) 
1 (1.8) 
0 
6 
9 



Serogroup (n) 

09 (1), 033 (2), 01 08 (2), 01 1 1 (1), 01 19 (8), 0142 (1), 0152 (1), 
0157 (1), 0169 (1), 0ND(7) 

026 (1), 09 (1), 096 (1) 01 1 1 (1), 0141 (1), ONT (2) 
026 (1), 055 (1), 0103 (1), 0153 (1), OND (3) 
026 (1), 055(1) 

026 (3), 055 (1), 01 1 1 (5), 01 19 (1), 0127 (2), ONT (1) 



a p = 0.03; Fisher's exact test (diarrheic x non-diarrheic). 
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Strain 1 5 10 15 20 25 30 35 38 

EAEC 042 EAST1 ATG CCA TCA ACA CAG TAT ATC CGA AGG CCC GCA TCC AGT TAT GCA TCG TGC ATA TGG TGC GCA ACA GCC TGC GCT TCG TGT CAT GGA AGG ACT ACA AAG CCG TCA CTC GCG ACC TGA 

EAEC17-2 EAST1 v1 A 

EPECN1 EAST1 v2 A G 

96-127-23 chromosome EAST1v3 C A A 

96-127-23 plasmid EAST1v4 G 

|A16, A47, A102, A120 
|A122, A139, A152 

Figure 4 Nucleotide sequence of the EAST1 gene and its variants, including the new one described in this study. Identical nucleotides 
are shown as dots. 



Sao Paulo between 2002 to 2003 [17-20]. All strains 
were characterized as tEPEC or aEPEC by hybridization 
with eae and EAF probes and serotyped (Table 1). 

Ethics statement 

The study was approved by the ethics committee of the 
Universidade Federal de Sao Paulo, Brazil. Stool samples 
were obtained with the written informed consent from 
the parents or guardians of the children. 

PCR assays 

For template DNA preparation, three to five isolated 
bacterial colonies grown on LB agar plates were pooled, 
suspended in 300 \A of sterile distilled water, and boiled 
for 10 min. PCR was carried out in a total volume of 
25-[il containing 5 \i\ of template DNA. PCR primers 
were EAST13a (F-5AGAACTGCTGGGTATGTGGCT) 
located 110 nucleotides upstream from the initiation 
ATG sequence of the astA gene, and EAST12b (R- 
5'CTGCTGGCCTGCCTCTTCCGT) located 20 nucle- 
otides downstream from the stop TGA sequence of the 
astA gene [26]. Cycling conditions were denaturation 
for 30 s at 95°C, annealing for 120 s at 55°C, and 
polymerization for 120 s at 72°C (30 cycles). PCR prod- 
ucts were analyzed by 2% agarose gel electrophoresis. 

DNA hybridization 

The following probes were used in this study: astA, a 
Ill-bp PCR product from EAEC 042 strain with the pri- 
mer set EASTlla (5'-CCATCAACACAGTATTCCGA) 
and EAST12b (5'-GGTCGCGAGTGACGGCTTTGT) 
[26]; and EAF, a 1.0 kb BarnHI-Sall fragment from 
plasmid pMAR2 [27]. The DNA fragments were puri- 
fied, labeled with [a- 32 P] dCTP with a DNA labeling 
kit (Amersham Pharmacia Biotech Inc., EUA) and used 
as probes. For Southern blotting, plasmid DNA was ex- 
tracted using the method of Birnboim and Doly [28], 
separated in 0.8% agarose gel electrophoresis, and 
transferred to a nylon membrane, following a standard 
protocol [29]. Blots were hybridized in a solution con- 
taining the labeled probe (10 5 cpm), 5 x standard saline 
citrate (SSC), 2 x Denhardt s solution (Invitrogen), 
0.1% sodium dodecyl sulfate (SDS), and 5 mg/ml of sal- 
mon sperm DNA for 16 h at 65°C. After hybridization, 
washes were done in aqueous solution with 2 x SSC 
with 0.1% SDS and exposed to X-ray film. 



RNA extraction and RT-PCR assays 

Total RNA was extracted after bacterial growth in LB 
broth for 18 h at 37°C with the RNase Mini extraction 
kit (Qiagen) according to the manufacturers instruc- 
tions. After extraction, approximately 1 \ig of total RNA 
was digested with DNase I (Qiagen) for 30 min at 37°C, 
and the enzyme was then inactivated by adding 1 \A of 
25 mM EDTA and heating the solution at 65°C for 
10 min. To obtain the cDNA, the SperScript III One 
Step RT-PCR System with Platinum Taq DNA polymer- 
ase (Invitrogen) was used according to the manufac- 
turers specifications. Primers for 16S ribosomal protein 
were used to control PCR [30], and the assay was then 
carried out with the primers EASTlla and EASTllb 
[26]. PCR products were analyzed by 2% agarose gel 
electrophoresis. 

Quantitative PCR was performed in a Mastercycler ep 
realplex 4 (Eppendorf), and threshold cycle numbers were 
determined using Eppendorf realplex software (version 
2.0). Reactions were performed in triplicate, and thresh- 
old cycle numbers were averaged. The 50- (il reaction 
mixture was prepared as follows: 25 \A of Platinum® 
Quantitative PCR SuperMix-UDG (Invitrogen), 10 uM 
of the Taqman probe (5'FAM-TGCATCGTGCATATG 
GTGCGCAA) and 10 uM of each primer (R-5'GCG 
AGTGACGGCTTTGTAG and F-5'GAAGGCCCGCA 
TCCAGTT), and 10 ul of cDNA (100 ng). The reaction 
consisted of: 2 min at 48°C; 10 min at 95°C followed by 
40 cycles of 15 s at 95°C, 1 min at 60°C, and 1 min at 
72°C. The astA expression of the tested strains was 
compared to the astA expression of EAEC 042, accord- 
ing to the formula, 2 ( " AACt) [31]. 

DNA sequencing 

Nucleotide sequencing of the PCR products was per- 
formed at the Centro de Estudos do Genoma Humano- 
USP, Sao Paulo. Nucleotide sequence data were analyzed 
using SeqMan and MegAlign software and the BLAST 
tool (http://www.ncbi.nlm.nih.gov/BLAST). 

Statistical analysis 

Data for diarrheic and non diarrheic children were compared 
using a 2-tailed Chi-square test. Results with p values < 0.05 
were considered to be statistically significant. 
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Nucleotide sequence and accession number 

The EASTlv5 gene sequence was deposited in the NCBI 
database under accession number KJ47188. 
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